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In this paper, we analyze single-sideband amplitude-modulation digital 
communication systems to develop a method for jointly and optimally 
setting the carrier phase and the automatic transversal equalizer of such 
systems. Mean-square equalization error is used as the performance 
criterion. We develop a simple receiver structure and study the convergence 
of the method. Exact locations of the stationary points in the parameter 
space are determined and the classifications of the stationary points are 
obtained. We show that the mean-square equalization error has only global 
minima and saddlepoints, but not local minima and maxima. Thus, the 
mean-square equalization error will converge to the absolute minimum by 
the proposed method, regardless of the initial settings of the parameters. 
A simple condition on the step sizes of the adjustments is also obtained 
which ensures the convergence of the process. Explicit formulas of the 
joint optimum, parameter settings and of the corresponding minimum 
mean-square error are obtained. For illustration purposes, a single-sideband 
digital communication system using a five- or nine-tap transversal equalizer 
is simulated on a computer. Both theory and simulation show that the 
equalization error depends critically on the carrier phase when the number 
of equalizer taps is not large, and that the minimum equalization error 
can be obtained by using the proposed method. 

I. INTRODUCTION 

In single-sideband amplitude-modulation digital communication sys- 
tems with transversal filter equalization, 1,2 the adjustment of the 
carrier phase is critical to the system's performance when the number 
of equalizer taps is not large. In this paper, a method is proposed for 
setting the carrier phase jointly with the automatic equalizer to minimize 
the mean-square equalization error. 
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We formulate a mathematical model of this study in Section II; 
in Sections III and IV, we analyze the system and develop a receiver 
structure. The problem of convergence is studied in Sections V and VI 
to determine if the equalization error will converge to the absolute 
minimum by the proposed method and whether such convergence 
depends on the initial settings of the parameters. We also consider 
step sizes of the adjustments. In Section VII, we derive explicit formulas 
for evaluating the system's performance. A voiceband data communica- 
tion system is simulated on a computer to test the proposed method 
and the results are described in Section VIII. 

II. MATHEMATICAL, MODEL 

As shown in Fig. 1, a single-sideband amplitude-modulation system 
is considered. When an impulse 5(0 is applied to the transmitter input, 
a signal a(t) is received at the receiving filter output. The Fourier 
transform of a(t) is denoted by A(j). (The Fourier transform of a 
function will be consistently denoted by the appropriate capital letter.) 
It is assumed that A(J) is band-limited between j x and / 2 , that is, 



A(j) ^0, only for U <\j\<U- 



(1) 



The signal a(t) is demodulated as shown in Fig. 1, where the demodulat- 
ing carrier frequency is / c . In single-sideband systems 
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Fig. 1 — An amplitude modulation system with coherent detection and transversal 
filter equalization. 
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/. ^ U (2) 

or 

U ^ U ■ (3) 

The demodulating carrier phase is denoted by 0. The demodulator 
output is a signal 7(0 with Fourier transform T(j). Since A(j) is band- 
limited, r(/) is also band-limited, that is, 

r(/) = 0, I / 1 > /„ (4) 

where /„ is / 2 — /, when / e is /, or f 2 (/o is larger than / 2 — /, if / c is less 
than /, or larger than / 2 ) . 

As shown in Fig. 1, the channel is equalized by a conventional trans- 
versal equalizer consisting oi 2N -\- 1 taps with gains e„ , n = — N to N, 
spaced at To-second intervals, where 

T = — seconds. (5) 

When an impulse 8(t) is applied at the transmitter input, the trans- 
versal equalizer output is s(t). Clearly s(f) is the overall impulse response 
of the system. The receiver parameters (that is, the demodulating 
carrier phase and the equalizer tap gains e n , n = — N to N) will be 
set to minimize the difference between the overall impulse response 
s(t) and a desired impulse response q(t). The familiar mean-square 
error criterion is used. That is, and e n , 11 = —N to N, will be jointly 
set to minimize the mean-square error 

So - f [«(0 - q(t)f dt. (6) 

J — oo 

For brevity, the tap gains e„ , n = —NtoN, will be abbreviated 
{e n \ in the sequel. 

III. ANALYSIS 

In this section, we analyze the system to develop a receiver structure 
for jointly setting 6 and |e„| by the method of steepest descent. 

In analyzing carrier signals, it is most convenient to use Hilbert 
transform techniques. As is well known, 1 the demodulator output y(t) 
is related to the input a(t) by 

7 (0 = cos {2*f e t + 9)a(t) + sin (2irf c t + 6)&(t) (7) 

where d(l) is the Hilbert transform of a(t). {When dealing with lengthy 
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time functions, we shall sometimes use the sign 30., that is, 3C[a(i)] = 

d(t).\ 

It is seen from Fig. 1 that 



N 

I 

—N 



s(l) = E e n y(t-nT -NT ). (8) 



We shall use the partial derivatives d& /dd and d8 /de„ in the method 
of steepest descent. Since q(t) is independent of 0, we obtain from 
equation (6) 



= f_2[s{t)-q(t)] d -^-dt. (9) 



In writing equation (9), the order of differentiation and integration 
has been exchanged [the underlying conditions for making such an 
exchange are easily satisfied by s(t) and q(t) encountered in communica- 
tion systems]. Substituting equation (7) into equation (8) and taking 
the partial derivative give 

^? = Z <U-sin [2xf e (t - nT - NT ) + d]a(t - nT - NT ) 

dd n--N 

+ cos [27r/ e (/ - nT - NT ) + 6]d(l - nT - NT )). (10) 
From equations (8) and (7), 

*(0 = Z e n [3C{cos [2irj c {t - nT - NT ) + d]a(t - nT - NT )} 



+ 0C{sin [2tW - nTo - NT ) + d]d(t - nT - NT )}). (11) 

In single-sideband modulation, we have either inequality (2) or 
inequality (3). Let us consider inequality (2) first. When inequality 
(2) holds, the frequency spectrum of a(t), A (J), does not overlap the 
spectra of cos 2tjJ and sin 2irf c t. Furthermore, A (J) occupies a higher 
frequency band; therefore, equation (11) becomes 

3C[cos (2rf e t + e)a(f)} = cos {2nrj r t + d)d(t), 
and 

X[sin (2tt/ c / + 6)d(t)] = -sin (2irlJ + 6)a(t), ] e ^ /, . 

Substituting the above into equation (11) gives 
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Ht) = L «n {cos [2t/,(/ - nT - NT ) + 6]d{t - nT - NT ) 

n--A' 

- sin [2TJ c (t - nT - NT n ) + 6}a(t - nT - NT )}, 

fc^U. (12) 
Comparing equation (12) with equation (10) shows that 

m = ^ , h ^ u . (is) 

Substituting equation (13) into equation (9) gives 



ago 

36 



f 2[s(/) - q(t)}s(t) dt, UlkU. (14) 



Since a function and its Hilbert transform are orthogonal, equation 
(14) reduces to 



dd 



? = -2 £ q(t)s(t) dl, f.Zfi. (15) 



Thus, d& /dd can be generated by correlating q(t) with §(t). Note 
that the transversal equalizer output will be s(t) instead of s(t) if 
the demodulating carrier cos (2irj e t + 0) is replaced by cos [2irj K t + 
6 + ir/2]. However, even though s(t) can be generated in this simple 
fashion, we prefer not to generate s(t) because the system must be 
used instead to generate s(t) to compute the other partial derivatives 
dS /de„ . Therefore, we convert equation (15) into the form 



as 

dd 



f = 2 f_ m<t) dt, j c ^u. (i6) 



This step can be verified by Parseval's theorem. Now we need only to 
correlate s(t) with an easily generated q(t) to obtain dS o /d0. 

The above is for the case /, ^ f l . In the other case, / e ^ / 2 , the 
frequency spectrum of a(t) occupies a frequency band lower than f c ; 
therefore, the two equations above equation (12) should be rewritten as 

0C[cos (2w1J + d)a(t)] = sin (2*1 J -f 6)a(t), 
3C[sin (2irj c t + d)d(t)] = -cos (2irf c t + 6)d{t), j e ^ f 2 . 
Repeating the steps from equation (12) to equation (16), we get 

^= -2J\(t)q(t)dt, f.^U. (17) 
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Note from equations (17) and (16) that the sign of the correlator output 
must be reversed when one shifts the carrier frequency from one side 
of A (/) to the other side. 

About the equalizer tap gains, it is seen from equations (6) and (8) 
that 

ds, r or , A /A1 <3s(o 



*- f_J[s(t)-q(t)) d j&dt, 



de„ 

= p 2[s(t) - q(t)]y(t - nT - NT ) dt, n = -N to N. 

(18) 

Thus, d& /de n can be generated by correlating the error signal [s(t) — 
q(t)] with the output y(t — nT — NT ) of the nth tap. This is the 
concept introduced in Ref. 3 where the problem of setting the tap 
gains {e,,} was considered. 

IV. RECEIVER STRUCTURE 

It has been shown in the previous section that dS o /d0 can be easily 
generated simultaneously with d£ /de n , n = — N to N. Therefore, 
the method of steepest descent can be used to adjust simultaneously 
6 and \e n \. In the training period prior to data transmission, isolated 
test pulses are transmitted. For instance, 5(0 in Fig. 1 may be one of 
the test pulses. The transmission of 8(t) generates a signal s(t) at the 
equalizer output. From s(t) the partial derivatives d& /dd and dS /de„ , 
n = —NtoN, are computed. The parameters and \e„] are then 
changed by amounts proportional to the partial derivatives, that is, 

(One. = (Oca - /s|^ , n = -N to N, 

where a and /3 are positive proportional constants which may vary 
from one adjustment to another (the choice of their values will be 
considered in Section VI). After the changes are all made, another 
test pulse is transmitted and the process is repeated. The process is 
terminated after a prefixed number of test pulses. 

The receiver structure is shown in Fig. 2. The partial derivatives 
d& /dd and dS /de„ are computed according to equations (17) and (18), 
respectively (/ c ^ / 2 is assumed). If f c ^ /i , the correlator output in 
Fig. 2 will be -dS, /dd. 



EQUALIZATION AND CARRIER ACQUISITION 

y(t-nT -NT ) 



l()7f) 




PHASE SHIFTER 



Fig. 2 — Block diagram of the joint method. (C in the block denotes correlator.) 



V. FURTHER ANALYSIS 

We now analyze the system to answer the questions: (i) Will the 
mean-square error converge to the absolute minimum by the proposed 
adjustments? (ii) Is the convergence ensured regardless of how the 
parameters 6 and \e n ) are set prior to the starting of the process? (iii) 
What is the minimum mean-square error that can be obtained by the 
proposed method? To answer these questions, it is necessary to locate 
the stationary points of 8 , to distinguish between various types of 
stationary points, to determine conditions under which 8 will converge 
to a global minimum, to derive explicit solutions of 8 when the param- 
eters are set jointly or independently, and to obtain numerical data 
by simulating a real typical channel. In this section, we determine the 
location of the stationary points and distinguish between various types 
of stationary points. 

It is noted from equations (16) and (17) that d£ o /d0 changes sign 
when the carrier frequency /«. is shifted from one side of A (/) (that is, 
U ^ /i) to the other side (/ c ^ / 2 ) . To avoid this complication, a quantity 
p defined by 



p = 6, when j c ^ /, , 
= —d, when fc^fi, 



(19) 
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will be used instead of 6 in the following. By this change, the results 
obtained in the sequel will hold for both f e ^ ji and / c ^ / 2 . Hence, 
the position of f e will not be identified further. 

Since it is more convenient to use matrix notations in the following, 
time samples will be used. Let 



s k = s(kT + NT ), 
q k = q(kT + NT ), 
y k = y(-kT ). 
Then equation (6) becomes 



(20) 



where 



So " 2/o S 



8 - £ («. - q k ) 2 - 



(21) 



Since / is a constant, minimizing £ minimizes £ . 

It can be easily shown that equation (21) can be written in the 
following matrix form 



8 = E'yE - 2E'v + £ 



Qk 



(22) 



where the prime denotes transpose, the vector E and v and the matrix 
y are defined by 



E = 



j e_iv 



_ e N J 



v = 



V-ff+l 

L v N J 



(23) 



y = 



V-N.-N ' ' ' V-N.N 

Lyif.-N ••• Vn,nA 

CO 

Vh.i = 2 7a-*7.-* , 



V n = HI QkJn-k 



(24) 

(25) 
(26) 
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Clearly, E and v are (2N -f- 1) X 1 vectors, and y is a (2N + 1) X 
(2N + 1) matrix. One can easily show that y is nonsingular. 

We now evaluate the partial derivatives d£/dp and dZ/de n , n = —N 
to N, to determine the optimum parameter settings. From equation (22), 

To evaluate the terms in equation (27), we note from sampling theorem 
that equation (25) can be written as 

V*.i = 2/ f y(t)y[t - (h - i)T ] dt. (28) 

By Parseval's theorem, equation (28) becomes 

Vh - 4 = 2/o /_„ I T(f) |2 exp [-#»*# ~ t)T ) df. (29) 

For single-sideband systems, the demodulating carrier phase p appears 
only in the phase characteristic of T(f), while the amplitude char- 
acteristic | T(/) | of r(/) is independent of p. Therefore, from equation 
(29), y hii is independent of p and 

^ E 'y E = o- (so) 

This greatly simplifies the results. (It is important to note that this 
simplification is not possible for double-sideband and vestigial-sideband 
systems because p appears in \T(f)\ in such systems.) Substituting 
equation (30) into equation (27) gives 

^=- 2 £ E '- (3D 

To evaluate equation (31), we convert the elements v n of v into explicit 
functions of p. For single-sideband systems, r(/) can be decomposed 
into the form 

r(/) = //(/) exp (-jp), / ^ 0; 

{62) 
= H(f) exp (j p ), /^0; 

where H(f) is independent of p.* From equations (26), (20), (4), and 

* H(f) \a the Fourier transform of y(t) when p = 0. The amplitude and phase of 
H(f) will be denoted by \H(f)\ and v (f), respectively, that is, H(f) = \H(f)\ exp 
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(32) , it can be shown that 

v n = Mn exp (— jp) + v n exp (jp), n = —N to N (33) 

where 



and 



M„ - E q* f° H(D exp [j2rf(k - n)T ] df 
Vn = Z <ft f # (/) exp [i2r/(fc - n)T ] df. 

A.-=-oo •'-/o 



(34) 



(35) 



Note that y. n and ^„ are independent of p. From equation (33), we can 
evaluate the term d/dpE'v in equation (31) to obtain 



as 

dp 



= 2;[exp (-;»E / v - exp 0»E'v] 



(36) 



where 



p — 1 




r- _ 


P-N 




V-N 


M--V+1 


v = 


V-N+l 


_ Mjv - 




_ Vn - 



(37) 



The above is for d8/dp. The other partial derivatives can be obtained 
from equation (22) as 



dE 



= 2yE - 2v. 



(38) 



A necessary condition for a specific p and E to be jointly optimum 
(that is, to jointly minimize S) is that they satisfy 



d& 
dp 



= 



and 



dE 



(39) 



(40) 



There are special cases where the optimum setting of p is arbitrary 
(for instance, if an infinite length tapped delay line is used, the taps 
can always be adjusted to reduce the mean-square error 8 to zero 
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regardless of how p is set). We shall not consider such special cases here. 
This implies, as can be shown from equations (36), (38), (39), and 
(40), that the special cases where y'y _I y is zero are disregarded. 
From equation (38), equation (40) can be written as 

E = y-'v. (41) 

From equations (36) and (41), equation (39) can be written as 

exp (-jpVy'V - exp G'p)v / y~ 1 v = 0. (42) 

One can show from equation (33) that equation (42) is equivalent to 

^f^ = exp(i4 P ). (43) 

It can be shown from equations (34) and (35) that 

Re [y'y-V] = Re [v'^'v], 

and 

Im [v'y -1 y] = -Im [v'y-'v], (44) 

where Re and Im denote real and imaginary parts of the complex 
number, respectively. From equation (44), one can show that equation 
(43) is satisfied if and only if 

7T , .,Im [u'y _1 u] 
p = m - + itan Rc fry-fr («> 

where m can be any integer including zero. Substituting equation (45) 
into equation (33) and substituting the resulting equation into equation 
(41) give 

E = exp[4^ + i tan--^^)]y-V 

+ e *p[^f + *tan--£^)] y -V (46) 

The value of p and E which satisfies the necessary conditions (39) 
and (40) is given by equations (45) and (46), respectively. It is clearly 
seen from equations (45) and (46) that, since m can be any integer, 
there is more than one set of solutions of p and E from conditions (39) 
and (40). In the following, we determine which of these solutions 
actually minimizes S . 

In conventional terminology,' 1 each solution of conditions (39) and 
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(40) is a stationary point of S . We shall determine which of the sta- 
tionary points is a global minimum. 

We first identify the minima, maxima, and saddlepoints. As is well 
known, 4 a sufficient condition for a stationary point to be a local mini- 
mum is that the quadratic form 

F = h'Qh (47) 

be positive-definite at that stationary point, where Q, known as the 
Hessian matrix, is 



Q - 



a 2 So 
a 2 So 

de-N dp deL; 



rso 


dp de-y 


^ 2 So 



<rSo 

dp de N 
d 2 £ 



de-N de* 



d 2 S n 



d 2 S 



and 



_ de N dp de N de_, v 

h' = [hi h 2 • • ■ /i(2iv+2>] 



a 2 S 
de 



■>n _ 



A stationary point is a local maximum if F is negative-definite at that 
point. At a saddlepoint, F is indefinite. To evaluate F, note from 
equations (16), (17), and (19) that 



^ = 2 f q(t)s(t) dt 
dp J-* 



and 



ds(t) 
dp 



= m- 



From equations (48), (49), (8), and (18), we get 

als 

dp 

d 2 8 

dp de 

and 



(48) 
(49) 

(50) 



= 2 [ q(t)s(t) dt; 

J— co 

<L = 2 f y(t - nT - NT )q(t) dt, n = -N to N; (51) 

6„ J-m 



_d 

de 



?!|fi- = 2 f y(t - iT - NT )y(t - jT Q - NT ) dt, 
i de,- •/-«, 

i, j = -N to iV. (52) 
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Substituting equations (50), (51), and (52) into equation (47) and 
rearranging, we obtain after some steps 

F = 2 f_ |m(/) + 2 h 2+n+N y{t - nT - NT )j dt 
+ 7h\ [ q(t)s(t) dt 

J -to 

- 2h\ f q\t) dt. (53) 

Now we may substitute equations (45) and (46) into equation (53) 
and evaluate the resulting expression over all possible hi to h (2 N+2) 
to determine whether F is positive-definite, negative-definite, or in- 
definite at a given stationary point. This determines if that point is 
a local minimum, a local maximum, or a saddlepoint. While these steps 
are important in the analysis, they are rather complex and are therefore 
given in Appendix A. It is also shown in that appendix that all the local 
minima are equal and hence all are global minima. The results in 
Appendix A are summarized in the following proposition. 

Pro-position 1. 8 has a global minimum when p and E are given, re- 
spectively, by equations (45) and (46), with m in these equation being 
an even integer. S has a saddlepoint when p and E are given, respec- 
tively, by equations (45) and (46), with m in these equations being 
an odd integer. 

It is seen from this proposition that there is an infinite number of 
global minima, each one corresponding to an even integer m . The 
distance in p between two adjacent global minima is therefore t. There 
is also an infinite number of saddlepoints, each one corresponding to 
an odd integer m . The distance in p between two adjacent saddlepoints 
is also 7r. The distance in p between a saddlepoint and its adjacent 
global minimum is ir/2. It is instructive to illustrate these with an 
example and a figure. Consider a transversal equalizer with only one 
tap e (such a single tap serves as an automatic gain control and 
the problem is to jointly set the automatic gain control and carrier 
phase to minimize the mean-square error). For simplicity, suppose 
that the term tan -1 [Im (y'y _I y)]/[Re (y'y _I y)] in equation (45) turned 
out to be zero. Then from the proposition S has global minima at 
p = 0, ±x ±2ir, • • • , and S has suddlepoints at p = ±ir/2, ±37r/2, 
The global minima at p = and w are illustrated by points 1 and 3 
in Fig. 3 and the saddlepoint at p = x/2 is illustrated by point 2. 
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So 




Fig. 3 — An example illustrating the global minima and saddlepointe of So. 



The curved surface in the figure illustrates the variation of 8 with p 
and e . For instance, when p is fixed at ir, 8 varies with e as shown by 
the convex curve passing through points 4, 3, and 5. (For fixed p, 8 
is a convex function of the tap gains e- N to e N .) As can be seen, point 
2 is a saddlepoint because varying e away from point 2 with p constant 
increases £ , while varying p away from point 2 with e constant 
decreases 8 . 

To summarize, in this section we have located the global minima 
and saddlepoints and proved that there is no local minimum or maxi- 
mum. There is also no valley 5 since the global minima are all distinct. 
These results will be used in the next section for the study of con- 
vergence and in Sections VII and VIII for the computation and com- 
parison of performances. 

VI. A CONDITION OF CONVERGENCE 

As described in Section IV, after a test pulse is transmitted and the 
partial derivatives are computed, the carrier phase and the tap gains 
are adjusted according to the equation 
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Anew - 0old-«|f , (54) 

(e„) ne w = (O.id-0^, (55) 

where a and /3 are positive proportional constants which may vary 
from one adjustment to another. As is well known, 5 the process always 
converges when a and /3 are sufficiently small, but may not converge if 
a and /3 are too large. In this section, we derive a condition on a and 
which ensures the convergence of the process. 

To facilitate analysis, we shall, as in Section V, replace 8 with p, 
S with 8, the tap gains e~ N to e N with the vector E, and the partial 
derivatives dS/de.* to dZ/de N with the vector 58/dE. The adjustment 
made after the fcth test pulse will be called the fcth adjustment (k = 
1, 2, • • •)• The values of p, E, 8, dS/dp, and <98/dE prior to the /hth 
adjustment will be denoted by p k , E* , &(p k , E k ), (d/dp)&(p k , E k ) and 
(d/dE)8(p fc , E k ), respectively. The a and /3 used in the Mi adjustment 
will be called a k and k , respectively (note that a k > and /3* > 
for all A; so that the adjustments will be made in the negative gradient 
direction). The values of p, E, 8, dS/dp, and dS/dE after the fcth adjust- 
ment will be denoted by Pk+1 , E,. +1 , &(p k+l , E A+1 ), (d/dp) &(p k+1 , E A+1 ), 
and (d/dE) 8>(p k+1 , E ft+1 ), respectively. 

With the above notations, equations (53) and (54) can be written as 

Pa+i = Pk — oi k — S(p i , E k ), (56) 

E t+1 -1,-fc^ S( P4 ,E 4 ). (57) 

The decrease in mean-square error due to the kth. adjustment is denoted 
by A8jt , that is, 

A8 t - 8(p* , E fc ) - &(p k+1 , E k+1 ). (58) 

Clearly AS* approaches zero when the partial derivatives approach 
zero. A stronger statement that may sometimes hold is that "AS* 
approaches zero only when the partial derivatives approach zero." 
By this statement is meant that for every e > 0, we can find a S > 
such that 

I A8 t | ^ 5 (59) 

if 
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[l 8 <» • «]' + [k £ <» • *>]'|jB e( " ■ E *>] 



> 



(60) 



We shall say that 8 converges to a stationary point if for every e > 
there is an N such that 



[^S(p,,E ( )] 2 + [^SU,E 1 .)]'[^ E (p 1 -,E 1 ) 

for all k > N. 

The following lemma is needed in our discussion. 



< e (61) 



Lemma. If AS fc > for all k and AS k approaches zero only when the 
partial derivatives approach zero, S must converge to a stationary point. 

Proof: The proof of this lemma is simple. Assume that S does not 
converge to a stationary point, that is, for an e > it is not possible 
to find an A'' such that equation (61) holds for all k > N. Then equation 
(59) must hold for an infinite number of k's. From this and the assump- 
tions in the lemma, we see that AS* ^ 8 > for infinite number of k's. 
This implies that S reduces without bound as k increases, contradicting 
the fact that the mean-square error cannot be less than zero. Hence 
the lemma holds. 

The lemma provides a means of determining a k and & . According 
to the lemma, £ will converge to a stationary point if we can determine 
a k and & such that AS* > for all k and AS* approaches zero only 
when the partial derivatives approach zero. Theoretically, such a k 
and k can be determined using known mathematical techniques. 8 
But unfortunately these techniques were developed for use with com- 
puters and are too complicated to be used in a receiver (for a discussion, 
see Appendix B). In the following proposition, it is shown that a h 
and /?* can be determined rather easily if E and p are set in some alternate 
fashion. 

Proposition 2. Let the set of positive integers be divided arbitrarily 
into two disjoint subsets 3Ci and 3C 2 , each containing an infinite num- 
ber of positive integers. Let a k = when k t 3Ci , and /3* = when 
k e 3C 2 . Let Xi denote the maximum eigenvalue of y (Xi > since y 
is positive definite), and let [/"„ s 2 (t) dt] Ek denote the value of Jf M s 2 (t) dt 
when E = E fc . The mean-square error S will converge to a stationary 
point if 

< ft < f (62) 
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for k t 3C, , and 

2* 

< «* < — r^ T^ « (63) 



2/u [i1 s2(0 (U \ t + 2/o /" r(0 dt 



for /c c 0C 2 . 

The proof of this proposition is complex and is given in Appendix C. 
The proposition states that p and E may be adjusted in any alternating 
fashion* and the mean-square error 8 will converge to a stationary 
point if /3* satisfies equation (62) during the adjustment of E, and 
a k satisfies equation (63) during the adjustment of p. The term Xi in 
equation (62) and the terms [/"«, s 2 (t) dt] Ek and /"„ q 2 (t) dt in equation 
(63) can be estimated or measured. Consider first X x , the maximum 
eigenvalue of y. There are various methods of estimating the maximum 
eigenvalue of a matrix. 6 ' 7 For example, it is possible to estimate Xi 
from amplitude characteristics of the transmission medium. [Note 
from Fig. 1 that the amplitude and phase characteristics of the trans- 
mission medium are denoted by | W(f) | and <o(/), respectively.] It is 
seen from equation (29) that the elements y hii of y can be computed 
from | T(/) |. For single-sideband systems, | T(f) | depends on | W(j) |, 
but not on w(/) and the demodulating carrier phase p. Thus, if statistics 
of | W{j) | are available (for example, in a voiceband system), y hfi 
can be computed from equation (29) and ^ can be estimated. The 
maximum possible value of X x then can be used instead of \ x in equation 
(62). 

The factor /"„ q'(t)dt in equation (63) is known because the desired 
signal q(t) is given. The other factor [f™„ s 2 (t)dt] Ek is simply the energy 
of the signal s(l) prior to the feth adjustment, and can be easily measured 
at the equalizer output. 

Summarizing the above, a condition of convergence has been described 
in the lemma. Based on the lemma, a specific condition of convergence 
has been obtained in Proposition 2. The upper bound in equation (62) 
can be estimated from a priori channel statistics and the upper bound 
in equation (63) can be easily determined prior to each adjustment. 
Thus, a k and /3 fc can be set accordingly prior to the adjustments to 
ensure that 8 will converge to a stationary point. 

It has been shown in the previous section that a stationary point 
must be a global minimum or a saddlepoint. Thus, 8 may converge 



f For example, one may fix p and adjust E until 3S/3E approaches zero, then fix E 
and adjust p until BL/dp approaches zero, and repeat the cycle until both 38,/dE and 
dZ/dp approach zero. 
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to a saddlepoint instead of a global minimum. Fortunately, such a 
possibility is remote. A great advantage of gradient methods is that 
they will inherently stay away from saddlepoints. 5 It has been found 
by researchers that gradient search computer program avoids saddle- 
points so dependably that the only way they could test their program 
for exploring the neighborhood of a pass was to start the search there. 
Wilde and Beightler suggested the reason by sketching a bi modal 
surface to show that only one gradient line out of the infinite number 
possible actually passes through the saddlepoint. 5 The other gradient 
lines all lead directly to a minimum or a maximum. Hence the possibility 
of converging to a saddlepoint is remote. 

It has been shown in the previous section that the distance in p 
between a saddlepoint and its adjacent global minima is x/2. From 
this, several tests can be devised for distinguishing between global 
minima and saddlepoints. The following one is particularly simple. 
At the design stage of the system, one may compute the value of 8 
at global minima and saddlepoints (the detailed steps in Section VII 
may be used. The value of 8 at global minima is equal to S min in equation 
(64), while the value of 8 at saddlepoints is equal to S ind evaluated at 
Ap = tt/2.) As illustrated in Section 8.1 and Fig. 4, the value of 8 at 
saddlepoints can be many times larger than that at global minima. 
Consequently, a threshold can be set up such that when 8 converges 
to a value above the threshold it may be concluded that a saddlepoint 
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is reached. Then p will be shifted by r/2 and E readjusted. If 8 con- 
verges to a value below the threshold, the process is terminated. This 
and other possible tests all require additional circuitry. Since the 
possibility of converging to a saddlepoint is remote, one should decide 
from actual trials whether such a test should be employed. 

VII. PERFORMANCE COMPARISON 

In the previous sections, we have considered jointly setting p and E 
for minimizing the mean-square error 8. It has been shown in Prop- 
osition 1 that the joint optimum settings of p and E are given, re- 
spectively, by equation (45) and (46) (with m in these equations being 
an even integer). Substituting these optimum settings into equation 
(22), we obtain the minimum mean-square error 

e min = Z ql - 2 | y'r'v I - Sy'y-V (64) 

It has been shown that this 8 mio can be obtained by setting p and E 
jointly. Now we wish to compare 8 min with what can be obtained by 
another scheme. In single-sideband systems, it is possible to transmit 
a carrier pilot to the receiver to demodulate the received signal. The 
demodulating carrier phase therefore is 

P = Pc + Pf, 

where p e is the phase of the received carrier pilot and p f is a fixed phase 
shift that is sometimes introduced for signal shaping. With p fixed at 
p c + p f , E can be adjusted to minimize 8. The value of 8 thus obtained 
will be denoted by 8 ind , where the subscript "ind" indicates that p 
and E are set independently. We now compare E ind with E min . 

To determine E ind , let the difference between p c + p, and the optimum 
setting of p be denoted by Ap. From equation (45), 

*P = P. + »-[-f + itaa-£j^] (65) 

where m can be any even integer (including zero). Since E is set to 
minimize 8 after p is set to p^ + p f , E is equal to y _1 v evaluated at 
p = p e + p f , that is, 

E = y~\ c (66) 

where v c is v evaluated at p = p e + p r . Substituting equation (66) 
into equation (22) gives 
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S ind = t ql- Ve'y-V, . (67) 

It can easily be shown from equation (33) that 

v;y _1 v c = exp [-j2(j> e + p / )]y / y"V + 2y'y~ 1 v 

+ exp [j2( Pe + P/ )]v'y" 1 v. (68) 

Substituting equation (65) into equation (68), and substituting the 
resulting equation into equation (67), one can obtain after some manip- 
ulations that 



Sind =E?;- 2(cos 2 Ap) | v'y-V I - Vy~V (69) 

fc=-M 

From equations (64) and (69), the difference between 8 ind and 8 min is 

Sind - S min = 2[1 - cos 2A P ] | v'y-y |. (70) 

Note that y' and y" 1 are independent of Ap. Thus, the only term in 
equation (70) that depends on Ap is cos 2Ap. Now we can make the 
following observations. 

First, S ind — S min is nonnegative, meaning that independently 
setting p and E increases the mean-square error. Second, S ind — S min 
varies periodically with Ap with a period of tt. Third, because of the 
nature of cosine, 8 ind — S min is small when Ap is small, but increases 
rapidly when Ap increases (note the factor two in cos 2Ap). 

For a given system, one may compute 8 ind , S min , and their dif- 
ference from equations (64) and (69). The computation can best be 
carried out by a computer program in the following steps (see also the 
example in the next section). First, specify f , N, and the desired 
signal q(t). Determine the time samples \q k } of q(t). Second, determine 
H(f) from transfer functions of the transmitting filter, the transmission 
medium, and the receiving filter. Third, compute the elements y hii 
of y from the equation 



Vh 



., = 4/o f | H(f) | 2 cos [2vf(h - i)T ] df. 

Jo 



(71) 



Compute the elements a hti of y x from y h<i . Fourth, compute the terms 
| y'y'y | and v'y _1 v in equations (64) and (69) using the explicit equa- 
tions 

I v'y-y I - Pi - B 2 ) 2 + 4B 2 ]*, (72; 

y'y-'v = B, + B 2 , (73) 
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where 

B,= E t a h .Mi , (74) 

A = _AT i--N 

B 2 = £ £ a hli Ui , (75) 

h--N i--N 

B 3 = £, £, °>nMt< , (76) 

h--N i = -K 
4--00 •'0 

r.- = Z ?* f ° I #(/) I sin h(0 + 2tt/(A;To - tT„)] d/. (78) 

*--oo Jo 

Finally, compute 8 min from equation (64) and 8 ind from equation (69). 

VIII. COMPUTER SIMULATIONS 

We now apply the previous results to a single-sideband partial- 
response voiceband data communication system using a five- or nine- 
tap transversal equalizer. Since the results are largely similar, we shall 
describe only the five-tap case. 

Two computer programs have been written for this system. In the 
first program, £ min and 8 ind are computed using the formulas in the 
previous section. In the second one, 8 min and S lnd are obtained using 
the method of steepest descent. The results of these two programs 
are described separately in the following subsections. 

8.1 Comparison o/ 8 min and £ ind Using the Explicit Formulas in Section 
VII 
Consider the voiceband data communication system described in 
Ref. 8. The desired signal q(t) of such a system is the class IV partial- 
response signal, that is, 

2>7T sin 2rUt - Q (7q) 

where a time delay l is included to take into account the time delay 
in the channel. From Ref. 8, 

/o = 1200 Hz. (80) 

Hence, T Q = l/2/ = 1/2400 seconds. We shall consider a transversal 
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equalizer with five taps, that is, N = 2. It has been defined that H(f) 
is the Fourier transform of y(t) when p = 0. The amplitude and phase 
of H(j) have been denoted by | H(f) | and tj(/), respectively, that is, 

H(f) = | H(f) | exp [j v (f)]. (81) 

It is assumed that the amplitude and phase distortions in the filters 
are negligible compared with those in the transmission medium. Con- 
sequently, | H (/) | and t)(j) can be determined from amplitude and 
phase characteristics of the transmission medium (which is a voice 
channel in this case). From Ref. 10 and Fig. 4 of Ref. 8, a typical 
| H(j) | is found to be 

| H(f) | = [sin 27rr /]10 - 3ro{/o - 2/) , ^ / ^ /o . (82) 

The component sin 2irT f in equation (82) represents the desired am- 
plitude characteristic of the class IV partial-response signal, while the 
term iQ°- 3T ° if °- 2n represents typical amplitude distortion in a voice 
channel. The delay characteristic of five links of K carrier shown in 
Fig. 24 of Ref. 11 is representative of the delay characteristic of a voice 
channel. Therefore, it will be used here and 

v (f) = 9.89 sin [2tt(/ + / ) -0.00019 - 2.203], 5 / £ / . (83) 

Constant phases and time delays in the transmission medium and the 
filters are omitted in writing ij(f) because their values are not available 
and because they only change the phase and time origins in the com- 
putations. However, such an omission makes it impossible to determine 
the phase p e of the received carrier pilot because the carrier pilot does 
not travel the same path as the signal. The signal is transmitted through 
the transmitting and receiving filters, while the carrier pilot is trans- 
mitted outside of these filters (these filters theoretically should have 
infinite attenuations at the carrier frequency). Furthermore, the carrier 
pilot is recovered at the receiver through a separate narrowband filter 
or a phase-lock loop, while the signal is demodulated and passes through 
a low-pass filter. With these differences and without detailed phase 
characteristics of the filters, it is not possible to determine p„ here. 
Therefore, we shall simply leave p e and Ap as variables and compute 
the variation of S ind with Ap. (It should be noted that Ap may assume 
small values in some real systems.) 

The variation of 8 ind with Ap has been determined for various values 
of t • The curve obtained at t = —2T is typical and is presented 
in Fig. 4. The value of 8 min is also indicated in the figure. It can be 
seen that 8 ind can be as large as 0.034, while £ min is only 0.0014. Thus, 
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the mean-square error can increase 0.034/0.0014 = 24.3 times if the 
demodulating carrier phase p is not set properly. Note that this large 
increase is obtained for a five-tap transversal equalizer. A similar 
result has been obtained for a nine-tap transversal equalizer. These 
results demonstrate that the mean-square error depends critically on 
the carrier phase setting when the number of equalizer taps is not large. 
(It should be noted, however, that when a large number of taps are 
used, the mean-square error will not be sensitive to the carrier phase 
setting.) 

8.2 Mean- Square Errors Obtained by Method of Steepest Descent 

A computer program has been written to simulate the sj'stem de- 
scribed in Section 8.1. The receiver parameters are adjusted by the 
method of steepest descent described in Section IV. Five hundred test 
pulses are used in each training period. The initial setting of the center 
tap e is unity, while the initial settings of the other taps are zero. The 
parameter t is fixed at — 2T as mentioned in Section 8.1. The mean- 
square errors obtained with various initial settings of p are shown in 
Fig. 5. 

The points marked by "X" in Fig. 5 are obtained by adjusting p 
and E jointly using a k = 0.523 and /3 A = 0.1 for all k. For instance, 
point A is obtained by initially setting p to 40 degrees above the optimum 
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p and then adjusting p and E jointly by the method of steepest descent. 
It can be seen from these points that the mean-square error 8 always 
converges to the value of 8 min determined in Section 8.1. This demon- 
strates the fact that 8 min can be obtained by jointly setting p and E. 

The other points marked by circles are obtained by fixing p and ad- 
justing E only (a k = and ft = 0.1 for all k). For instance, point B 
is obtained by initially setting p to 40 degrees above the optimum p and 
then adjusting E only by the method of steepest descent. It can be seen, 
by comparing these circled points with the value of 8 ind in Fig. 4, that 
the mean-square error 8 converges, as expected, to 8 ind . 

Only 500 test pulses are used in each training period because 8 
converges within this period in all cases. For instance, the convergence 
of 8 to the values shown by points A and B in Fig. 5 are illustrated, 
respectively, by curves A and B in Fig. 6. It can be seen that 8 converges 
rapidly to the final values. 

IX. SUMMARY AND CONCLUSIONS 

It is shown in this paper that in single-sideband systems the trans- 
versal equalizer and the carrier phase can be set jointly by the method 
of steepest descent to minimize the mean-square equalization error. 

The system is analyzed and a receiver structure is developed. The 
receiver structure is theoretically as simple as a conventional one. 
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Fig. 6 — An illustration of convergence. 
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A well-known problem of the method of steepest descent is that the 
function to be minimized may converge to a local minimum instead 
of to a global minimum. To prove that this troublesome problem does 
not arise here, the variation of the mean-square error in the parameter 
space is analyzed. Exact locations of the stationary points in the param- 
eter space are determined and the classifications of the stationary points 
are obtained. It is shown that the mean-square error has only global 
minima and saddlepoints, but not local minimum or maximum. Thus, 
it is not possible for the adjustments to converge to a local minimum, 
regardless of the initial parameter settings. This completely eliminates 
the problem. 

It is also shown that two adjacent global minima (or two adjacent 
saddlepoints) are separated by 180 degrees in demodulating carrier 
phase, while a global minimum is separated from its adjacent saddle- 
points by 90 degrees in demodulating carrier phase. From this, a test 
is described to determine whether a global minimum or a saddlepoint is 
reached by the adjustments and to correct the settings if a saddlepoint 
is reached. This test may not be necessary because both previous ex- 
perience and theoretical considerations have shown that the method of 
steepest descent inherently stays away from saddlepoints. 

The choice of the step sizes of the adjustments is also considered. 
There are methods for determining the step sizes; however, they require 
complicated computations. As an alternative, it is shown in this paper 
that the step sizes can be easily determined if the equalizer and the 
demodulating carrier phase are adjusted in different steps (the steps 
can be alternated in any manner). 

Closed-form expressions of the joint optimum parameter settings 
and of the corresponding minimum mean-square error are obtained for 
computation of system performance. For illustration purposes, a single- 
sideband data communication system using a five- or nine-tap trans- 
versal equalizer is simulated on a computer. Both theoretical and simu- 
lation results show that the equalization error can increase by ten times 
or more when the carrier phase is not properly set. These demonstrate 
that when the number of equalizer taps is not large, the equalization 
error depends critically on the carrier phase setting. The computer 
simulation also verifies the theory that the equalization error can be 
minimized by using the joint method described in this paper. 
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APPENDIX A 

Classification of Stationary Points 

In this appendix, we prove the statements that: 

(i) F is positive definite when p and E are given, respectively, by 
equations (45) and (46) , and m in these equations is an even integer; 
(ii) F is indefinite when p and E are given, respectively, by equations 
(45) and (46), and m in these equations is an odd integer; and 
(Hi) All the local minima of 8 are equal. 

Consider the first statement. Let ft be the set of all h except h = 0. 
Let ft be decomposed into two disjoint sets ft, and ft 2 , where ftx contains 
the h's with h t 5^ 0, and ft 2 contains the h's with ^ = 0. Consider first 
fti . Since h t =^ in Q t , equation (53) can be written as 

F = 2h\ f° [a(t) - q(t)f dt + 2h\ f q(t)s(t) dt - 2h\ f q\t) dt 

J-00 J— tO «'-«> 

(84) 
where 

*(i) = Z em - nT - NT ) (85) 

and 

e / = h*2±*L n = -N to N. (86) 

For the sake of brevity, we shall denote the entire right side of equa- 
tion (45) by p and the entire right side of equation (46) by E . For 
any function G, the symbol [G] fa denotes the value of G evaluated at 
p = p , the symbol [G] Po , Eo denotes the value of G evaluated at p = p 
and E = E , and the symbol Min G denotes the minimum value of 
G in fi, . These symbols and notations may be used jointly. For instance, 
Min [(?]„„, e. denotes the minimum value of [G] Po , E „ in Qi . 

Since h t ?* in Q t , [F] POiEo > in 0, if and only if [F/2^] Po . Eo > 
in ft, , or if and only if Min [i r /2^]p..Eo > 0. From equation (84), 



Min 
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MinTf KO - q(i)f dl\ 

_«'-00 J|:.. 

" ?(0s(0 d/ 

L j -O0 Jpo.Eo 



+ 



(87) 



In writing the above equation, we have used the fact that a(t) is inde- 
pendent of E, s(t) is independent of h, and q(t) is independent of p, E, 
and h. 

Consider the first term on the right side of equation (87). From equa- 
tion (85), a(t) is a function of y(t), where ~ indicates Hilbert transform. 
It can be shown from equations (7) and (19) that 

[?(*)],. = [t(0Lw2 . (88) 

Now we have 



M 



- [/: 



[a(l) - q(t)] 2 dt 



1 



- M 



in JT [ £ e!m - nT - NT )] P . - g(/)T dt, 



r <v 



■ Mill / I Y, <W - nT - NT )] Po+r/ 2 - g(oT dt, 

•'-co L" — v J 

= {Min r I £ e'Mt ~ nT - NT ) - q(t)\ dff 



(89) 



Note that equation (88) is used in the second step above. Since the 
term ^L-» e M* — nT — NT^) in the last expression above is similar 
to s(t) in equation (8), the whole term in the { j in the last expression 
above is equal to £ minimized with respect to E. This proves that 
Min [/"„ [a(t) — q(t)] 2 dt]„, is equal to S minimized with respect to E 
and evaluated at p = p + t/2. It can be easily shown from equations 
(21) and (22) that 8 minimized with respect to E is equal to 
1/2/oE?— c ql - v'y"v]; hence, 



M 



in [/" [a(i) - q(l)] 2 dtj 



\l>»+*/.l 



= Hi" Z qt - Tjr [v'y V] 

1 °° 1 

= W £ ql - 57" [2tf'y _1 v - cxp (-j2poVy"'y - exp 0'2p )v'y _1 v], 

-Jo ft — » -Jo 

where equation (33) has been used in the last step. 



(90) 
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We have evaluated the first term in equation (87). Consider now the 
second term in equation (87). From equations (20), (23), and (26), 

f q(t)s(t)dt = ~v'E. (91) 

J-oo &Jo 

Substituting equations (33), (45), and (46) into equation (91) and sim- 
plifying, we obtain, after some steps, 

I f" q(t)s(t) dt\ =j tf'y-'v + ^- exp (-jWvY'v 

LV-oo Jpo.Eo Jo *7o 

+ 57" exp (i2po)v'y-'v. (92) 

The last term in equation (87) is 

f q\D dt = ±± ql. (93) 

J-oo ^/O *--oo 

Substituting equations (90), (92), and (93) into (87) and canceling out 
the terms having opposite signs, we obtain 

Min MU = f exp (-fip^vY'u + r exp (i2p )v'y -1 v. 

L2/IiJ Po .e, 7o Jo 

Note that so far m can be any integer. Let | y'y~V I denote the magni- 
tude of y'y _1 y. It can be shown from equation (44) that when m„ is 
an even integer, the right side of equation (94) is equal to 2// | y'y'u |. 
Hence, when m is an even integer (including zero), 

Min [£l = T I B'y'lf I • (95) 



(94) 



Since | y'y'v I f* 0, equation (95) shows that Min [F/2^] p . . E . > 0. 
Thus, [F]„. . Eo > in Q, when m is an even integer. 

Now consider fi 2 . Since h t = in fi 2 , equation (53) is reduced to 

F - 2 f" | 2 h 2+n+N y(t ~ nT Q - NT ) ' d*. (96) 

J-oo Ln--AT 

Since t(« - nT - NT ), n = -N to N, are linearly independent and 
h a to /i 2 w+a cannot be all zero in ft 2 , the integrand in equation (96) 
cannot be zero for all t. Hence, F > in fi 2 . This implies, of course, 
[H..b. > 0inO a . 

We have shown that when m„ is an even integer, [F] Po . Ea > in ft x 
and ft 2 . Hence, when m is an even integer, [F] Po . Eo > for all h except 
h = 0. This proves the first statement at the beginning of this Appendix. 
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Now we prove the second statement at the beginning of this Appendix. 
Note that the derivations from equations (84) through (94) hold for 
all integer m Q . It can be shown from equation (44) that when ra is 
an odd integer, the right side of equation (94) is equal to — 2// | y'y'y |. 
Hence, when m is an odd integer, equation (94) becomes 



bhti 



Min | £s | = -j\ y'y-'tf ! • (97) 

. E„ JO 



Since | y'y'V I 9* 0, Min [F/2h]] p ., E . < 0. Thus, [F] Po . Eo can be negative 
in fix (and hence in 12). It can also be positive in 12 (for instance, [F] POtEo 
> in 12 2 ). Therefore, F is indefinite when p and E are given, respectively, 
by equations (45) and (46), and m in these equations is an odd integer. 
This proves the second statement. 

Finally, consider the third statement. We have shown that F is posi- 
tive definite and S n has a local minimum when p and E are given, re- 
spectively, by equations (45) and (46), and ra in these equations is 
an even integer. Substituting equations (45) and (46) into equation 
(22) and letting m be an even integer, we see that the local minimum 
of 8 is 

8o = ttt £ & - t I »'y- » I - T tf'y V 08) 

£]0 *--oo 10 JO 

Since the right side of equation (98) is independent of m , all the local 
minima of 8 are equal and hence are all global minima. This proves 
the third statement at the beginning of this Appendix. 



APPENDIX B 

Discussion of Existing Method 

A number of methods are described in Ref. 5 for determining the 
proportional constant in steepest descent adjustments. It has been 
pointed out that these methods can be used to solve certain nonlinear 
equations on computer. 12 

These methods require elaborate computations. For example, con- 
sider the possibility of determining a k and 0* using the theorem on page 
31 of Ref. 5. Since it is assumed in that theorem that a single propor- 
tional constant is used for all parameters, we change the scale of p 
or E such that it is also appropriate to use a single proportional constant 
in our case. After this is accomplished, we can use a single constant a k in 
equations (56) and (57). To determine a l: , define 
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g( Pk , E k , a k ) = -Tp- _,, A \ r =-r • (99) 



a k 



L 7 P «(*.»■)_ 



+ 



£[£«*•*>]'} 



Choose a constant 5 in the range < 5 ^ §. Compute gr(p* , E t , 1). 
If g( Pk , E k , 1) < 5, choose a k < 1 so that 5 ^ f/(p t , E k , a k ) ^ 1 — 5 
(it has been shown that this choice is always possible). If g(p k , E k , 1) 
^ 5, choose a k = 1. 

It can be seen from this description that the method requires elaborate 
computations and a k must be determined on a trial and error basis when 
g(p k , E k , 1) < 5. It is therefore difficult to use this method in a receiver. 

APPENDIX C 

Proof of Proposition 2 

In this Appendix, we prove Proposition 2. Consider first k e 5Ci . 
Since a k = when k c 3Ci , we have p k+1 = p k and 

A 8* = 8( Pt ,E t ) - &( Pk ,E* +1 ), 
= EfrE, - 2E^v + £) ql 

A-— oo 

- [E^ +1 yE t+1 - 2E£ +1 v + £ gjl • (100) 

From equations (57) and (38) 

E* + , = E t - fr(2yE t - 2v). (101) 

Substituting equation (101) into equation (100), we rearrange the 
resulting equation into the form 

AS, - 4&(yE t - v)'[I - fi k j](jE k - v) (102) 

where I is the identity matrix. Let the eigenvalues of y be denoted, in 
the order of decreasing magnitude, by A, , i = 1 to 2N + 1, so that 

Ai ^ A 2 ^ • • • ^ A 2JV+ i . (103) 

It can be shown that y is positive definite; hence, 

A, > 0, i = 1 to 2iV + 1. (104) 

Let Ui , i = 1 to 2N + 1, be a set of orthonormal eigenvectors of 
y. Let 

Q = [uiU 2 • • • Our+J. (105) 
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That is, the zth column of Q is u, . From well-known matrix properties, 

y = QDQ' (106) 

where D is a diagonal matrix whose ith diagonal element is X,- . Further- 
more, 

QQ' = I. (107) 

Substituting equations (106) and (107) into equation (102), one can 
write 

AS* - 4ft E (1 - (3*X.)[(yE* - v)'u,f. (108) 

If ft satisfies equation (62), we can write 

1 - ftX, ^ P, > 0. (109) 

Then, from equations (103), (108), and (109), we have 

A8 fc ^ 4ftP, I) [(yE t - v)'u,-] 2 . (110) 

It can be shown from equations (38), (107), and (105) that 

[^ s(p fc , E ft )]' [^ 8(p 4 , E t )j = 4 g' [(yE 4 - v)'u.-] 2 . (Ill) 

Comparing equations (110) and (111) gives 

A£ 4 ^ P*p\Je S(p* , E fc )]'[^ S(P* . E*)j- (112) 

Now we have the conclusion: 

Conclusion 1 . If equation (62) holds, AS t is bounded below by equation 
(112) which shows that A£ t is positive and approaches zero only when 
the partial derivatives (d/dE)S(p* , E t ) approach zero. 

The above is for k t 3Z X . Next we consider k e JC 2 . Since ft = when 
k z 3C 2 , we have E A+1 = E fc . From equations (58) and (22) 

AS fc = 8(p* , E t ) - S(p t+1 , E*), (n3) 

= -2E*(v t - v t+l ), 

where 

v fc = exp (— jp*)v + exp (jp*)v, (114) 

v A+ i = exp (— jp k +i)v + exp (jp t+1 )v. (115) 
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From equation (56) and the two equations above, 
v* = v*+j = Si - cos or* — S(p t , E t ) |[exp (— ;p*)m+ exp (jp k )v] 

- jsin a k — S(p t , E k ) [exp (—;>*)» - exp 0*P*)v]. 

(116) 

The vectors y and v are denned in equation (37) and their elements 
are defined in equations (34) and (35). It can be seen from equations 
(34) and (35) that the elements of y and v are complex numbers. Let 
u be written as 

v - K + i« (H7) 

where the elements of £ and C are rea l numbers that can be determined 
from equation (34). Comparing equation (35) with equation (34), we 
see that the elements of v are complex conjugates of those of y. Hence 
v can be written as 

v = 5 - ft. (118) 

Substituting equations (117) and (118) into equation (116) and then 
substituting the resulting equation into equation (113), we obtain after 
some steps, 

AS* = 4Jcos \a k j- &( Pk , E t )J - l||Etf cos Pk + E^sin Pk ] 

- 4 sin \a k j- S,( Pk , E k )\ [EU cos Pk - E^sin pj. (119) 

From equations (36), (117), and (118), 

— £(p t , E t ) = 2jE t (exp (—;>*)» — ex P O'pOv), 
op 

= 2iEr(-2isin Pt ? + 2j cos p*C), 

= 4[E^sin Pk - EK cos pj. (120) 

It is clear from equation (56) that, if {d/dp)&{p k , E*) = 0, then p k + x = 
p k and from equation (115), A8 fc = 0. So we need to evaluate AS* only 
for (d/dp)S(p k , Ej) ^ 0. It can be seen from equation (120) that, when 
(d/dp)£,(p k , Eft) p* 0, EJ £ and E£< cannot be simultaneously zero. Hence, 
we can define a quantity 
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* = tan_1 iff (121) 

and a quantity 

c = mir + (rot* > o. (122) 

From equations (121) and (122), equation (120) can be rewritten as 

j- p S( Pt , E k ) = 4C sin (p 4 - 0). (123) 

Substituting equation (123) into equation (119), using equations (121) 
and (122), and rearranging, we obtain, after a number of steps, 

AS* = 4Cfcos [p k - <f> - 4Ca* sin ( Pk - <j>)] - cos ( Pk - <p)} (124) 

or 

AS* = 8C sin [ Pk - <j> - 2Ca k sin ( Pk - 0)] sin [2Ca k sin ( Pk - 0)]. (125) 

It may be assumed that ^ Pk — <p ^ 2r (a factor of 2t or its multiple 
can be dropped). Since (d/d P )£,( Pk , E k ) ^ 0, Pk — $ cannot be 0, ir, or 
2ir [see equation (123)]. Hence, we have either 

< Pk - 4> < ir (126) 

or 

ir < Pk - < 2tt. (127) 

We shall consider equation (126) first. 

As can be seen from equation (56), the constant <x k determines the 
size of each adjustment (note that a k > is required so that the ad- 
justments will be made in the negative gradient direction). We wish 
to determine the permissible range of a k , that is, to determine a number 
5 such that AS* > for every a k in the range < a k < 8. It can be 
shown from equations (124) and (126) that 5 can only be as large as 
1/2C; hence, the permissible range of a k is 

< a„ < ~ (128) 

From equations (128) and (126), we obtain 

< 2Ca k sin ( Pk - <p) < 1. (129) 

It can be easily shown from equation (129) that 

sin [2Ca k sin ( Pk - <f>)] > - 4Ca k sin ( Pk - 0) > 0. (130) 

IT 
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It can be shown from equations (126) and (128) that [p k — <p — 2Ca k sin 
(fit — <f>)] must be in the range 

< [ Pk - - 2Ca k sin ( Pk - if)) < | (131) 

or in the range 

| < [p t - * - 2Ca* sin (p t - 0)] < t. (132) 

When equation (131) holds. 

2 

sin [ Pk - <f> - 2Ca k sin ( Pk - <t>)] ^ - [p* - - 2Ca* sin ( Pk - 0)]. 

(133) 

Combining (133) and (130) and using equations (125) and (123), we 
obtain, after some simplification, 

AS fc > %* [2 - 4Ca*][^ &( Pk , BOj"- (134) 

It can be shown in a similar manner that when equation (132) holds, 

AS 4 >^[^S( Ptl E,)] 2 - (135) 

It has been shown from equations (126) and (128) that either equation 

(134) or equation (135) holds. In a similar manner, it can be shown 
from equations (127) and (128) that either equation (134) or equation 

(135) holds. Thus, we have the conclusion: 

Conclusion 2. When equation (128) holds, either equation (134) or 
equation (135) holds. Physically this means that, when equation (128) 
holds, A8 t is positive and approaches zero only when (d/dp)S(p* , B») 
approaches zero. 

It is not easy to determine whether equation (128) is satisfied in 
practice because the constant C depends on E k , ?, and C and is some- 
what difficult to compute. Hence, we wish to replace equation (128) 
with inequality (63) in Proposition 2. 

From the definition of v k in equation (114) and from equations (117) 
and (118), we can write 

2EIV* = 4[cos Ph ]Eti + 4[sin Pk )EU. (136) 

From equations (136) and (22), one can show that 
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U - S(j>k , E,) = 4Etf cos Pk + 4EJC sin Pk (137) 

where 

U = EfrE* + f) ql. 

A — -co 

Note that the terms EfrE* , ££_„„ Ql , E& and EXC in equation (137) 
are independent of p. Furthermore, equation (137) holds for all p k . 
Letting p k = in equation (137), we obtain 

L k - [S(p* , E 4 )]„-o = 4Ei? (138) 

where [£.(p k , E t )],»-o is £(p* , E*) evaluated at p k = 0. Since 8(p k , E k ) 
^ for all p k , we have from equation (138) 

L k ^ 4E& (139) 

It can be shown from equation (22) that 

[SfofBJltt-a < 2L». (140) 

From equations (140) and (138), 

-L k < 4Etf. (141) 

From equations (139) and (141), 

<R0" ^ &L 3 t . (142) 

We have obtained equation (142) by letting p k = in equation (137). 
If we let p k = tt/2 in equation (137), we obtain, in a similar manner, 

(EK) 2 ^ -hV k . (143) 

From equations (142), (143), and (122), 

C ^ ^i L > • C144) 

Using sampling theory, we can verify that 



U = 2/ 



f 8*(t) dt] + 2/„ f q\t) dt (145) 

V-00 J E l •'-co 



where [/?„ s 2 («) <ft] Bl is /!? w s 2 (i) dt evaluated at E = E* . 

It can be seen from equations (144) and (145) that, if equation (G3) 
holds, equation (128) is satisfied. Thus, we have the conclusion: 

Conclusion 3. When equation (63) holds, equation (128) holds and, 
from Conclusion 2, either equation (134) or equation (135) holds. Hence, 
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when equation (63) holds, AS* is positive and approaches zero only when 
(d/dp)&(p k , Ea) approaches zero. 

Now, let us summarize the results presented in this appendix. It has 
been shown that when equation (62) holds in 3C, , AS* is bounded below 
by equation (112) (see Conclusion 1). It has also been shown that when 
equation (63) holds in 3C a , AS* is bounded below by either equation 
(134) or equation (135) (see Conclusion 3). It is seen from the lower 
bounds in equations (112), (134), and (135) that AS* is positive and 
approaches zero only when the partial derivatives approach zero. Since 
3d and 3C 2 each contain an infinite number of fc's and since the mean- 
square error S cannot reduce without bound, S must converge to a 
stationary point. This proves Proposition 2 in Section VI. 
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